Picture for Shikun Feng

Shikun Feng

CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning

Add code
May 26, 2026
Viaarxiv icon

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Add code
Feb 15, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

CORD: Bridging the Audio-Text Reasoning Gap via Weighted On-policy Cross-modal Distillation

Add code
Jan 23, 2026
Viaarxiv icon

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Add code
Jan 08, 2026
Viaarxiv icon

Straight-Line Diffusion Model for Efficient 3D Molecular Generation

Add code
Mar 04, 2025
Viaarxiv icon

UniGEM: A Unified Approach to Generation and Property Prediction for Molecules

Add code
Oct 14, 2024
Figure 1 for UniGEM: A Unified Approach to Generation and Property Prediction for Molecules
Figure 2 for UniGEM: A Unified Approach to Generation and Property Prediction for Molecules
Figure 3 for UniGEM: A Unified Approach to Generation and Property Prediction for Molecules
Figure 4 for UniGEM: A Unified Approach to Generation and Property Prediction for Molecules
Viaarxiv icon

Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

Add code
Jul 14, 2024
Viaarxiv icon